Determining the minimum number of types necessary to represent the sizes of protein atoms

نویسندگان

  • Jerry Tsai
  • Neil Voss
  • Mark Gerstein
چکیده

MOTIVATION Traditionally, for packing calculations people have collected atoms together into a number of distinct 'types'. These, in fact, often represent a heavy atom and its associated hydrogens (i.e. a united atom). Also, atom typing is usually done according to basic chemistry, giving rise to 20-30 protein atom types, such as carbonyl carbons, methyl groups, and hydroxyl groups. No one has yet investigated how similar in packing these chemically derived types are. Here we address this question in detail, using Voronoi volume calculations on a set of high-resolution crystal structures. RESULTS We perform a rigorous clustering analysis with cross-validation on tens of thousands of atom volumes and attempt to compile them into types based purely on packing. From our analysis, we are able to determine a 'minimal' set of 18 atom types that most efficiently represent the spectrum of packing in proteins. Furthermore, we are able to uncover a number of inconsistencies in traditional chemical typing schemes, where differently typed atoms have almost the same effective size. In particular, we find that tetrahedral carbons with two hydrogens are almost identical in size to many aromatic carbons with a single hydrogen. AVAILABILITY Programs available from http://geometry.molmovdb.org. CONTACT [email protected]; [email protected]; [email protected] SUPPLEMENTARY INFORMATION Available at http://geometry.molmovdb.org.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Mixed Integer Programming Approach to Optimal Feeder Routing for Tree-Based Distribution System: A Case Study

A genetic algorithm is proposed to optimize a tree-structured power distribution network considering optimal cable sizing. For minimizing the total cost of the network, a mixed-integer programming model is presented determining the optimal sizes of cables with minimized location-allocation cost. For designing the distribution lines in a power network, the primary factors must be considered as m...

متن کامل

Molecular Dynamics Simulation of Al Energetic Nano Cluster Impact (ECI) onto the Surface

On the atomic scale, Molecular Dynamic (MD) Simulation of Nano Al cluster impact on Al (100) substrate surface has been carried out for energies of 1-20 eV/atom to understand quantitatively the interaction mechanisms between the cluster atoms and the substrate atoms. The many body Embedded Atom Method (EAM) was used in this simulation. We investigated the maximum substrate temperature Tmax  and...

متن کامل

Labeling of Human Serum Albumin with Stable Isotope of Bromine; an in Vitro Study

Background: Possibility to trace-label albumin with isotopes results in information concerning its synthesis, breakdown, and distribution in the intra and extra cellular spaces. The iodination of albumin is a widespread procedure used in scientific studies. Bromine not only is more reactive and less expensive than iodine, but bonds more easily with many elements. Therefore, it could be a suitab...

متن کامل

Determining the Sample size for Estimation of the CCC-R Control Chart Parameters Based on Estimation Costs

In today's highly competitive industrial environment due to fast technology development, quality practitioners will to detect out-of-control situations and take actions whenever is necessary as soon as possible. Accordingly, new statistical procedures have been enhanced incessantly both to handle high yield processes along with looking for methods of minimizing all quality cost. CCC-r chart, th...

متن کامل

Computational studies of carbon decorated boron nitride nanocones

Density functional theory ,(DFT) calculations have been performed to investigate the properties ofcarbon decorated (C-decorated) models of boron nitride (BN) nanocones. To this aim, the apex andtip of nanocone have been substituted by the carbon atoms to represent the C-decorated models. Theresults indicated that dipole moments and energy gaps could reveal the effects of C-decorations onthe pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 17 10  شماره 

صفحات  -

تاریخ انتشار 2001